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Do debit cards decrease cash demand? Evidence from a causal 
analysis using Principal Stratification 
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ABSTRACT 

It has been argued that innovation in transaction technology may modify the cash holding be¬ 
haviour of agents, as debit card holders may either withdraw cash from ATMs or purchase 
items using POS devices at retailers. In this paper, within the Rubin Causal Model, we inves¬ 
tigate the causal effects of the use of debit cards on the cash inventories held by households 
using data from the Italy Survey of Household Income and Wealth (SHIW). We adopt the 
principal stratification approach to incorporate the share of debit card holders who do not use 
this payment instrument. We use a regression model with the propensity score as the single 
predictor to adjust for the imbalance in observed covariates. We further develop a sensitivity 
analysis approach to assess the sensitivity of the proposed model to violation to the key uncon- 
foundedness assumption. Our empirical results suggest statistically significant negative effects 
of debit cards on the household cash level in Italy. 
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sensitivity, unconfoundedness. 
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1 Introduction 


Since the early 1970s, the diffusion of payment eards, sueh as debit, eredit, pre-paid eards, 
raises the eoneern whether our soeieties are transforming into eashless soeieties. Cash has not 
disappeared in the meanwhile, however the interest in the question remains, partieularly for 
what eoneerns the wide diffusion of debit eards. Debit eards are defined as eards enabling 
the holder to have purehases direetly eharged to funds on his aeeount at a deposit-banking 


institution (C.P.S.S., 20011. The faet that debit eards allow withdraws at ATM points justifies 
the need of less eash holdings; at the same time the possibility to pay direetly at POS deviees 
at retailers makes this payment instrument a elose substitute for eash. Moreover, both of the 
operations, withdrawal and payment, involve very low, or most of the times null, eosts for 
debit eards eompared to the rest of noneash instruments. 

The possible effeet of the use of debit eards on eash holdings has also important impliea- 
tions for eentral banks as the sole issuer of eash. First, substitution of eash for eards eould lead 
to deerease in seignorage ineomes. Beeause banknotes represent non-interest-bearing eentral 
bank liabilities, early eoneerns emerged among polieymakers fearing that the eash shrinkage 
would have lead to a deeline in eentral bank asset holdings and, eonsequently, in seigniorage 


revenue (Stix 20041. Seeond, in low interest rate regimes, the eash-eard substitution ean be 
very sensitive to interest rate, and eonsequently interest rate adjustments as a monetary poliey 
to influenee bank lending may be diffieult (Markose and Lokel 2003 [ Yilmazkuday and Yaz 


gan 20091. The existing literature has been foeused on the impaet that payment eards may 


have on the funetional relationship of the eash demand to other variables that eentral banks use 
to implement monetary poliey ( |Duea and Whiteseli] |1995[ [Attanasio et al.\ |2002[ [Lippi and 


Seeehi[ |2009[ Alvarez and Lippij [2009 1 while relatively little attention has been direeted to 


quantifieation of the effeets of debit eards on the level of eash inventories held by individuals 
or households. 
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This paper aims to evaluate the eausal effeet of debit eards on eash inventories using data 
from the Survey on Household Ineome and Wealth (SHIW), a bi-annually national survey run 
by Bank of Italy on several aspeets of Italian household eeonomie and finaneial behaviour. 


We adopt the Rubin Causal Model (RCM) ( |Rubin[[T974[[T978] ) to eonduet the eausal analysis. 
Under the RCM, for eaeh post-treatment variable, eaeh unit has a potential outeome eorre- 
sponding to eaeh treatment level, and a eausal effeet on that post-treatment variable is defined 
as a eomparison between the eorresponding potential outeomes on a eommon set of units. A 
eritieal requirement of the RCM is that a “eause” (or a treatment without distinetion) must 
or at least eoneeptually be manipulatable - the prineiple of “no eausation without manipula¬ 
tion” ( |Holland[ |1986| ). This raises a eoneeptual ehallenge in our applieation beeause using 
debit eards is largely a voluntary aetivity, and it is not elear that we eould expose a person to 
use debit eards in any verifiable sense. Instead it is more natural to eoneeive the possession 
of eards—a status eontrolled by banks that issue eards—as the treatment variable. However, 
the primary researeh interest obviously lies in the effeet of using eards rather than possessing 
eards. In faet, a signifieant portion of Italian households in the SHIW who have debit eards 
do not use them. Moreover, even among eard users, there is various degree of usage: some 
use debit eards only to withdraw eash from ATM oeeasionally, while others use them fre¬ 
quently for both eash withdraw and payment at retailers. The issue is not limited to Italy, for 
example data from a survey eondueted in Austria in 2003 reports evident shares of non-users 


and different use frequeneies among debit eards holders (Stix 2004). Naturally there may be 
heterogeneous effeets of debit eards on eash holding among different groups of eard users. 


We propose to address these eomplieations via prineipal stratifieation (Frangakis and Ru¬ 


bin 2002), a unified framework for eausal inferenee in the presenee of post-treatment vari¬ 


ables. The key is to treat the possession of debit eards as the treatment and the use of eards 
as a post-treatment intermediate variable between the treatment and the outeome. Prineipal 
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stratification is a cross-classification of the population based on the joint potential values of 
an intermediate variable under each of the treatment, i.e., principal strata, and the interest lies 
in estimating causal effects local to certain prineipal strata. For example, the foeus in our 
applieation is the eausal effeet in the stratum of units who possess and use debit eards - the 
“eompliers”. This is similar to the instrumental variable approaeh to noneomplianee in ran¬ 
domized experiments ( Angrist et al.\ 1996[ Imbens and Rubin] 1997[ Hrrano et a/.[ |2000 ), a 
speeial ease of prineipal stratifieation. Recently a rapid growing literature has extended prin¬ 
eipal stratifieation to a wide range of settings in both experimental and observational studies, 
including “censoring by death” (Rubin 2006[ Mattei and Mealli[ |2007[ Zhang et al. 20091, 
missing data ([Mattei et al.\ |2014|), surrogate endpoints ([Gilbert et al.\ [2003^ [Li et al.\ [2009 


201 1[ ), mediation analysis ( [Gallop et al.\ [2009 [ [Elliott et~ar\ [2010[ ), and designs ( [Mattei and 


Mealli 2011[). More eomplex settings sueh as ordinal or eontinuous intermediate variables 


have also been explored (Frangakis et al. 2004 Jin and Rubin 2008; Griffin et al. 2008 
Sehwartz et a/.[[20TT] ). 


Beeause only one of the two potential outeomes is observed for eaeh unit, prineipal strata 
are latent, and thus identification of principal causal effects relies on assumptions sueh as 
uneonfoundedness and exelusion restrictions. Unconfoundedness is particularly crucial in our 
study given the observational nature of SHIW. We taekle this issue in two steps. First, we 
propose a model-based regression approaeh where, to reduee the risk of mis-specifieation due 
to the imbalance in the multiple eovariates between treatment groups, we use the estimated 
propensity score ( [Rosenbaum and Rubin[ [1983b[ ) as the sole predictor. Then, we design and 
eonduet a comprehensive sensitivity analysis on unconfoundedness: we eneode the degree 
of violation to unconfoundedness as sensitivity parameters in the assumed regression model 
and examine how estimates ehange over plausible range of the sensitivity parameters. This 


vein of sensitivity analysis in eausal inference originates from Rosenbaum and Rubin (1983a). 
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In particular, our method is built upon that of Schwartz et al. (2012), who elaborated the 
various pathways of confounding and examined the sensitivity to both unconfoundedness and 
exclusion restriction, in the context of principal stratification. Alternative sensitivity analysis 
approaches have also been developed in the literature (e.g. |Grilli and Mealli[ |2008^ [Sjolander 
et a/.i[2009||Jo and Vinokuri|2011t [Stuart and Jo1|2013t [Gilbert et al.\\2QQ3) . 

The rest of the article is organized as follows. In Section 2, we introduce the basic setup, 
estimand and assumptions under principal stratification. In Section 3, we propose a model- 
based approach for estimation and a sensitivity analysis under the model. Section 4 presents 
the data and the empirical results. Section 5 concludes. 


2 Basic setup under principal stratification 


2.1 Notions and estimands 


Because debit cards are typically issued to individuals, the natural statistical units would be 
individuals possessing debit cards in our analysis. However, SHIW collects information only 
on the household level. To mitigate the problem, we set household as the unit, but limit the 
sample of treated units to the households possessing one and only one debit card during the 
study period. This ensures a possible effect on household cash holding will be due only to the 
individual possessing the card, who is usually the head of the household. 

Consider the study sample consists of N units. For unit i, let Zi be the binary treatment, 
equal to 1 if the household possesses one and only one debit card and 0 otherwise; Di be the 
binary post-treatment variable, equal to 1 if the household uses a debit card and 0 otherwise]^ 
Yi be the outcome, the average amount of cash held by the household; and Xj be the set of 
pre-treatment covariates. Because utilization of debit cards is a post-treatment event, we can 


^We will define two different characterization for the use of debit cards in Section 4.1 
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define its eorresponding potential outeomes: let -Dj(l) and -Dj(O) be the potential debit eard 
usage status if unit i does and does not have a eard, respeetively, equal 1 if the unit uses 
the eard and 0 otherwise. Similarly, let Yi(l) and ^^(0) be the potential outeome, if unit i 
does and does not possess a debit eard, respeetively. These notations of potential outeomes 
imply the aeeeptanee of the Stable Unit Treatment Value Assumption (SUTVA; Rubin, 1980), 
that is, no interferenee between the units and no different versions of a treatment. SUTVA is 
deemed reasonable in this study, beeause the holding of debit eards in one household does not 
seem to affeet the potential debit eard utilization or eash inventory of other households. For 
eaeh unit i, we only observe the potential outeomes eorresponding to the observed treatment: 

D, = D,{Z,),Y, = Y,{Zi). 

A prineipal stratifieation with respeet to a post-treatment intermediate variable D isa parti¬ 
tion of units based on the joint potential values of D, i.e., prineipal strata: S'* = (A(0), A(l))- 
When both Z and D are binary, there are four prineipal strata in theory: 5, e {(0,0), (0,1), (1,0), (1,1)}. 
In this applieation, obviously one ean not use debit eards without possessing one, and also there 
are units who possess eards but not use them. As sueh, our study sample automatieally satisfies 
a strong ‘monotonieity’ eondition: 


Assumption 1 (Monotonicity). (1) A(0) = 0; (2) 0 < Pr(i9j = 0|Xj, Zi = 1) < I, for all i. 


Under monotonieity, there are only two prineipal strata: S'* = (0, 0) = n, units who would not 
use debit eards irrespeetive of whether possessing one, and Si = (0,1) = c, units who would 
use debit eards if possessing one but would not use if otherwise. We refer to these two strata 
as never-users and eompliers, respeetively, following the nomenelature of noneomplianee in 


Angrist et ah (19961. 


By definition the prineipal stratum membership Si is not affeeted by the treatment assign¬ 
ment. Therefore, eomparisons of V(1) and Y (0) within a prineipal stratum are well-defined 
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causal effects because they eompare quantities defined on a eommon set of units. Here our 
interest lies in estimating the eausal effeets for the treated eompliers, that is, units possessing 
and using debit eards. Thus we define the targeted estimand to be the average eausal effeet of 
the treated eompliers (CATT): 


CATT = E[F,(1)-F,(0) | 5, = c, Z, = 1] = E,|z=i{E[y;(l)-y;(0) \S, = c,Z, = 1, X, = x]}. 

( 1 ) 

Analogously we ean define the eompliers average treatment effeet (CATE), also known as the 


loeal average treatment effeet (LATE) Imbens and Angrist (1994): 


CATE = E[Fi(l) - y)(0) I 5, = c] = Ex{E[r,(l) - F^O) | 5^ = c, = x]}. (2) 


Both CATE and CATT are intention-to-treat (ITT) effeets, representing effeets of possessing 
debit eards, rather than effeets of using eards. To attribute these effeets to the use of eards, 
we make the following exelusion restrietion assumption for the eompliers, following the es¬ 
tablished literature in the IV approaeh to noneomplianee (e.g. Angrist etal.\ 1996; [Imbens and 


Rubin 1997): 


Assumption 2 (Exclusion Restriction for eompliers). For all units with Si = (0,1), the effect 
of card possession is only through using the card. 

A formalized version of this assumption, whieh requires double-indexed notations, is given 

(Assumption 23.4). This type of exelusion restrietion is in faet 
routinely made, often implieitly, in randomized experiments with full eomplianee. 


in ( Imbens and Rubin}[2015 


2.2 Identification of the causal effects 

Eor the same unit, only one of the two potential outeomes (i9j(0), Zi)j(l)) is observed, and 
thus the prineipal stratum Si is at most partially observed. The following assumptions are 
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necessary for establishing the nonparametric identifiability of CATT and CATE (Imbens and 


Angrist 1994). 


Assumptions (Overlap). 0 < Pr(Zi = l|Xj) < 1, for alii. 


Assumption 4 (Unconfoundedness). ^^(0), A(0)} -L 


Assumption 5 (Exclusion Restriction for never-takers) 


E[y;(l)|Xi,5i = n]= E[Fi(0)|Xi,5, = n]. 


Assumption]^ is the standard overlap condition, stating that each unit has a positive prob¬ 
ability of possessing a card. Assumption [^states that the treatment assignment is independent 
of all the potential outcomes conditional on observed pre-treatment variables; it is also known 
as the assumption of “no unmeasured confounders”. Assumptionj^states that possessing debit 
cards does not affect the outcome for never-takers. Though both are exclusion restrictions, 
Assumption and Assumption are of very different nature: the former is a necessary con¬ 
dition for identifying the causal effects, whereas the latter is made solely for interpreting the 
ITT effects as effects of the actual treatment received. More discussions on the difference can 
be found in Mealli and Pacini| ( |2013[ ) and |Imbens and Rubin| ( |2015] )(Chapter 23). 

In randomized experiments. Assumptions [3]|^ automatically hold without conditioning on 
covariates, and CATT equals CATE. In observational studies, unconfoundedness generally 
relies on conditioning on a number of observed covariates, and CATT and CATE are usually 
different. Consequently analysts often adopt regression models to adjust for the covariates to 
estimate the causal effects. However, two complications often arise, as in this application: first, 
distributions of some covariates can be significantly imbalanced between treatment groups. 


leading a regression analysis to rely heavily on model specification (Imbens 20041; second. 


unconfoundedness may still be questionable even conditioning on a large number of observed 
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covariates. To address the first issue, we propose to combine the propensity score method and 
regression adjustment. To address the second issue, we conduct a comprehensive sensitivity 
analysis around the regression models. 


3 Model-based estimation and sensitivity analysis 


3.1 Models 

In the context of principal stratification, six quantities are associated with each unit: Fi(0), 
li(l), A(0), Di{l), Zi, Xi. Under nonconfoundedness, the joint distribution of these quanti¬ 
ties can be written as: 

Pr(y,(0),F,(i),A(0),A(i),^*,x,) 

= Pr(r,(0), Fi(l)|5„ X,) Pr(^,|X,) Pr(Z,|X,) Pr(X,) 

= Pr(y,(0), e(X,)) Pr(^,|e(X,))e(X,) Pr(X,), (3) 

where e(Xj) = Pr(Zi = l|Xj) is the propensity score. In the analysis, we will condition on the 
observed distribution of covariates, so that Pr(Xj) does not need to be modeled. Therefore, 
three models are required for inference: one for the propensity score, one for the principal 
strata given the propensity score, and one for the potential outcomes given principal stratum 
and the propensity score. 

Compared to the nonparametric approach, the model-based approach is more flexible, can 
reduce bias and improve precision, and also offers conceptually straightforward ways to incor¬ 
porate complexities like multilevel structure, multiple outcomes, and latent variables. How¬ 
ever, parametric models on a large number of pre-treatment variables are also more sensi¬ 
tive to mis-specification ( Rubin[ 19791. In particular, imbalance in the pre-treatment vari¬ 
ables between treatment groups or between different strata renders causal effects estimation to 
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rely heavily on extrapolation, and eonsequently, on the funetional speeifieation. Beeause the 
propensity seore reduees the dimension from the spaee of eovariates to one, and balanee of the 
propensity seore leads to balanee of eaeh observed eovariate, an attraetive alternative is to use 
the estimated propensity seore as the sole eovariate (e.g. [Heekman et a/T| |1998| ). This method 
is not as effieient as the regression estimator based on adjustment for all eovariates when the 
model is eorreetly speeified (Hahn[ 1998), and one has to estimate the propensity seore, whieh 


is also subjeet to mis-speeifieation. However, simulation studies in Mereatanti and Li (2014) 
show that, in the presenee of eovariate imbalanee between treatment groups, mis-speeifieation 
of the regression model leads mueh larger biases and mean square error (MSE) than mis- 


speeifieation of the propensity seore. Moreover, in a reeent simulation-based study. Hade and 


Lu (2014) suggests that if the distributions of the estimated propensity seore in the treated and 
untreated groups have different shapes but roughly the same support, as in our applieation, 
then regression on the estimated propensity seore performs well eompared to the eonventional 
regression model on the entire set of eovariates and other propensity seore based methods 
(e.g. matehing, stratifieation and weighting). Therefore, given the relatively large number of 
eovariates in our applieation, we ehoose the regression on propensity seore approaeh. 

Sinee there are only two strata in our applieation, we use a logistie regression model for 
the prineipal stratum membership S: 


logit(Pr(S'j = n\Xi = x)) = ao + e{x) ■ a. 


( 4 ) 


And we assume a linear regression model for eontinuous potential outeomes, with different 
intereepts and slopes for different strata: 

Vi{Yi{z)\Si,Xi = x) = lsj=c • (/3co + z ■ 9c + e{x) ■ /3ci) + lsi=n • + e(a;) • /3„i) -f e^, 

( 5 ) 

where e* ~ iV(0, cx^) and is an indieator funetion that equals one if S'* = s and equals 
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zero otherwise. It is easy to show that 6c is the CATE, and the CATT ean be subsequently 
estimated by averaging the differenees between the observed outeomes for treated eompliers 
and their estimated eounterfactuals: 


MT 


EiA ■ Z.-|y.-(/3^ + e(Xi).fti)| 

E.A ■ A 


The maximum likelihood (ML) estimates of the parameters are obtained using an EM (expeetation- 
maximization) algorithm. In the E-step the unobserved principal strata are replaced by their 
expectations given the data and the current estimates of the parameters; then in the M-step, 
the likelihood conditional on the expected principal strata is maximized. Standard errors are 
obtained by the outer product of gradients evaluated at the ML estimate for the parameters in 
0 and and by the bootstrap for the CATT. 


3.2 Sensitivity analysis 


Our sensitivity analysis is in the same spirit of Rosenbaum and Rubin[ (1983a), where the 


assignment to treatment is assumed to be unconfounded given the observed covariates X and 
an unobserved covariate U, but is confounded given only X. Rosenbaum and Rubin suggest 
to specify a set of parameters characterizing the distribution of U and the association of U 
with Z and Y(z) given observed covariates. Assuming a parametric model, the full likelihood 
for Z, y(0), y(l), (7 given X is derived and maximized, fixing the sensitivity parameters to 
a range of known values, and the results are compared. In order to incorporate the additional 
complexity of the immediate variable in our setting, we adopt a simpler setup similar to that in 


Schwartz et al. (2012). In particular, we do not directly model the distributions involving U, 
instead we model the consequences of an unmeasured confounder. Specifically, in the presence 
of U, Equation (|^ no longer stands, instead we have 


Pr(y)(0), E,(l), S,\Zi, X„ U,) = Pr(E,(0), E,(1)|Z„ U,) Pr(^,|Zi, U,). 
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Therefore, even eonditional on the observed eovariates, the proportion of prineipal strata and 
the distribution of the potential outeomes within a prineipal stratum ean differ aeross treat¬ 
ment groups. These two ehannels of eonfounding are referred to as S'-eonfounding and V- 
eonfounding in |Sehwartz et al. (20121, and are eneoded in the following two models modified 
from (|^ and ([^, respeetively. 

The prineipal strata model (|^ is expanded to aeeount for the S'-eonfounding: 


logit(Pr(Si = njZi = z, Xi = x)) = ao + e{x) ■ a + ^ ■ z, 


( 6 ) 


where ^ is the log odds ratio of being a never-taker among eard-holders versus among non 
eard-holders eonditional on eovariates: 


Pr(Sj = n\ 

|Z, = l,Xi)/Pr(S, = c| 

\Zi — 1, Xi) 

Pr(Sj = n\ 

\Z, = 0,Xi)/PT{Si = c\ 

\Zi = 0, Xi) 


The parameter ^ is estimable from the observed data, but nevertheless ean also be viewed as 
a sensitivity parameter: when the uneonfoundedness assumption holds, the odds ratio should 
be 1 and ^ = 0, and eonsequently large absolute value of the estimated ^ suggests severe 
imbalanee in the proportions of prineipal strata between treatment groups and thus large S'- 
eonfounding. Here ^ is imposed to be the same aeross different values of X. We have also 
fitted the model with an interaetion term between X and Z but observed little differenee in our 
applieation. 

For the potential outeomes models, in the presenee of unmeasured eonfounding, it is impor¬ 
tant to differentiate the assignment 2 ; in the definition of potential outeomes and the observed 
assignment Z, and we make the distinetion using different subseripts zi and Z 2 . Speeifieally, 
we expand model ([^ by adding two sensitivity parameters as follows: 

Vi{Y,{z^)\Si,Z, = Z2,X, = x) 

= l5,=c ■ (/^cO + Zi9(. Z 2 rjc + e{x) ■ f3c) + lSi=n • (/SnO + ^2Vn + g{x) ■ /3„) -f 6^, (7) 
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It is straightforward to show that rjc and rjn account for the Y -confounding for compliers and 
never-takers, respectively: 

= Pr(F,(zi)|Z2, = 1,^. = = 0, = s,X,), 

for s = n,c. For parsimony, the model assumes that rjc and are eonstant aeross x and z. 
We have also eonducted analysis with interaetion terms between X and Z, and the results are 
similar. In our applieation, rjn (or rjc) is the differenee in eash holding between a never-user 
(or eomplier) who has a debit eard and a never-user (or eomplier) who does not have a eard. 
When the uneonfoundedness assumption holds, ?7c = = 0, and model Q reduees to model 

0 . 

The sensitivity analysis is earried out by comparing the ML estimates of the CATT while 
fixing the sensitivity parameters at a range of plausible values. Among the three parameters, 
(^5 Vc, Vn), ^ is estimable from the data and we fix it at the estimated value. Since for eaeh unit 
only the potential outeome eorresponding to the observed treatment is observed, zi = Z 2 in all 
observed units in Model Q, and thus only the sum 9c + rjc but not eaeh individual parameter 
is identifiable. For example, one ean not differentiate the two sets of parameters (^c, rjc) and 
{6c -\- v^rjc — v) for any v. Therefore, in the sensitivity analysis, we will vary the values of rjn 
while fixing rjc to 0 for convenience, and explore possible range of v in the interpretation. 

4 Application to the Italian SHIW data 

4.1 The Data 

The SHIW has been run every two years since 1965 with the only exception being that the 
1997 survey was delayed to 1998. We denote by t the generie survey year, and by {t + 1) 
the subsequent survey year. We define the target population as the set of households having 
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at least one bank eurrent aeeount but neither debit eards nor eredit eards at t. The treatment 
Z is posed equal to 1 if, at t + 1, the household (all members eombined) possesses one and 
only one debit eard and no eredit eards, equal to 0 if, at t + 1, the household possesses neither 
debit eards nor eredit eards. The households with more than one debit eard other than those 
possessing at least one eredit eard are exeluded from the sample. The reason to exelude the 
households holding eredit eards is that these households usually already possessed at least one 
debit eard ( |Mereatanril |2008| ), and thus inelusion of these households would lead to imbalanee 
in eredit eard holdings between treated and untreated households. Given that eredit eard is a 
payment instrument potentially affeeting eash demand, this would overestimate the effeet of 
debit eards. Therefore, a household for whieh Z = 1 is eharaeterized by having aequired their 
first (and only) debit eard during the span t —)■ (f +1). The two binary post-treatment variables 
D identify the way in whieh the debit eard is aetually used: D is posed equal to 1 if debit eard 
is used to make ATM withdrawals at least one time per month on average (withdrawers), or 
debit eard is used to make payment at POS deviees at least one time per month on average 
{POS users). For the post-treatment variable withdrawers, the relevant survey question asks 
the households how many withdrawals were made on average per month at ATM points during 
the survey’s year. For POS users, the relevant survey question asks the number of times, on 
average per month, the debit eard was used direetly at supermarket or shops to make payments 
by means of POS terminals. We eonduet separate analysis with eaeh of the two D variables. 
The outeome Y is the average eash inventory held by the household and is observed at f -f 1. 
The relevant survey question asks the sum of eash household usually have in the house to meet 
normal household needs. 

The eovariaties X inelude the lagged outeome, some baekground demographie and soeial 
variables referred either to the household or to the head householder. The subset of eovariates 
referred to the household ineludes the overall household ineome, wealth, the monthly average 
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spending of the household on all eonsumer goods, and the following eategorieal variables: the 
number of earners, average age of the household, family size, the Italian geographieal maero- 
area where the household lives, the number of inhabitants of the town where the household 
lives. Those related to the head householder inelude age and edueation, both of whieh are 


eategorieal. As shown in Mereatanti and Li (2014), the probability of a household having one 
debit card increases with income, the town size, education of the head householder, from the 
South to North of Italy, while decreases with the average age of the household. 

Table shows, for the years 1995, 1998 and 2000 a non-negligible share of debit card 
holders who rarely use the card to withdraw cash from ATMs or to pay for purchases at POS 
devices at retailers. The share of non-users is less for withdraws at ATMs than for POS pay¬ 
ments. 


Table 1: Per cent of households with bank account, possessing one debit card and no credit 
cards, by debit card usage. 


Year 

Sample size 

Less than one ATM withdrawal 

per month on average 

Less than one POS payment 

per month on average 

1995 

1727 

23.2 

87.2 

1998 

1645 

16.8 

68.7 

2000 

1857 

19.3 

59.2 


A simple descriptive cross-sectional analysis on the subsample of households observed in a 
single sweep of the survey shows the difference in average cash inventory between households 
possessing one debit card and households without a debit card is -121.6, -118.1, and -169.1 
thousands of Italian Liras in 1995, 1998, and 2000, respectively. Though not sufficient to 
establish causal effects of debit cards on cash holding, this shows that consumers in Italy who 
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possess debit eards hold less eash eompared to those who do not on average. 

SHIW is a repeated eross-seetion with a panel eomponent, namely only a part of the sample 
eomprises households that were interviewed in previous surveys. Our analysis will foeus on 
the households observed for two eonseeutive surveys. Table reports the samples sizes for 
eaeh span, f —>■ (f + 1), where t = 1993, 95, 98, respeetively by treated and untreated units. 
The relative frequeney of untreated units alongside the total sample size has a eonsiderable 
drop after 2000. Aeeordingly, the analysis will be limited to the span until 1998-00, the latest 
span with eonsiderable share of both untreated units and total sample size. 

Table 2: Sample sizes and relative frequeney of treated (Zi = 1) and untreated (Zi = 0) units 
for eaeh span. 


t —>■ (f -(- 1) 


= 1 


= 0 

Total 

size 

rel. freq. 

size 

rel. freq. 


1993-95 

164 

.Ill 

764 

.823 

928 

1995-98 

143 

.274 

379 

.726 

522 

1998-00 

114 

.182 

513 

.818 

627 


4.2 Assessment of covariate overlap and balance 

We first assess eovariate overlap and balanee in the studied sample. Figure [^presents the his¬ 
tograms of estimated propensity seores for treated and untreated groups for eaeh span, whieh 
were estimated from a logistie model with main effeets of eaeh eovariate. The histograms 
show a satisfaetory overlap in the support of the propensity seore for all three spans, so that no 
trimming is needed, and further, this provides basis for our propensity seore regression method 


based on the suggestions of Hade and Lu (2014). 
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Figurej^reports, for each span, the boxplots of the absolute standardized differences (ASD) 
in covariates between treated and untreated group: 


ASD 


Ef=i 




/ + sl/N„, 


( 8 ) 


where is the number of units and is the standard deviation of the covariates in group 
Z = 2 ; for 2 ; = 0,1. The boxplots reveal significant imbalance in a large number of covariates 
between groups. Therefore, adjustment of covariate imbalance is crucial in this application. 


Figure 1: Histograms of the estimated propensity score for the treated (blue) and the untreated 
(red). The first is for the span 1993 to 1995, the second is for the span 1995 to 1998, and the 
third is for the span 1998 to 2000. 





Figure 2: Boxplots of the absolute standardized difference of all covariates. 



span 1993—1995 Span 1995—1998 Span 1998—2000 
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4.3 Results 


We fit the models in Seetion via the EM algorithm. Table eompares the proportion of 
never-users among the treated units estimated from the model with and without the sensitivity 
parameter ealeulated as follows: 

Pr(^ = n\Z = l)= [ Pr(^ = n\Z = 1, e{x)) ■ Pr(e(a:)|Z = 1) dx, 

J e(x) 

where Pr(S' = n\Z = 1, e{x)) is the model for S and Pr(e(a;)|Z = 1) is approximated by 
the observed distributions of the estimated e{x) among the treated units. As a referenee, we 
also present the moment estimates of this quantity, whieh is the proportion of non-users in the 
group of households with one debit eard. Table shows that the moment estimates elosely 
resemble estimates from the model with the sensitivity parameter ^ (model (|^) in all spans, 
but differ mueh from those from the model without the parameter (model Q), suggesting the 
latter is subjeet to large bias due to unmeasured eonfounding (elaborated later). 

Table [^reports the results obtained from the sensitivity model with ^ free in equation (|^, 
and with rjc = 0 and varying values of rjn in equation (|7]). The parameter represents the 
level of Y -eonfounding for never-users by the differenee in average eash inventories between 
never-users with one debit eard and without debit eard. Absolute values of rjn greater than 
400 thousands of Italian Lira (LIT hereafter) is eonsidered too high given the reported values 
of average eash inventories, therefore we limit the range of examined values for T]n between 
-400 and 400 thousands LIT. The average eash inventories for never-users with one debit eard 
is observable and is in the ranges from 630 to 788, and from 703 to 990 thousands of LIT 
for POS users and withdrawers, respeetively. Estimates of ^ are negative under eaeh seenario. 
This suggests that, eonditionally on the propensity seore, the probability of being non-users is 
lower among treated units (eard-holding households) than untreated units (non eard-holding 
households), revealing a violation of the uneonfoundedness assumption due to S'-eonfounding. 
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Indeed we found the untreated eompliers most eonsist of a small group of households with 
high cash inventories. The span 1995-1998 emerges with the highest estimated value of Oq (the 
intercept of the model for S), implying a high proportion of non-users in the group of untreated 
households. Estimates of the CATEs and CATTs are consistently negative with small standard 
errors. The span 1995-1998 shows larger values of the two estimated causal effects, likely due 
to the detected higher S'-confounding. The estimated values of ao, a, ^ and in particular of our 
targeted estimand, CATT, remain stable across the range of t]c ^ 0, while the estimated CATEs 
are equal to those obtained under = 0 minus the alternative value of rjc. Tableshows that, 
for each of the six considered scenario, the estimated CATT is insensitive to the degree of 
unmeasured confounding within a reasonable range. The estimated CATTs for withdrawers 
are comparatively more stable, in particular the span 1998-2000 that shows values contained 
in the small range from -1547 to -1572 thousands EIT. The estimated CATTs for POS users are 
more sensitive to the proposed values of rj^ presumably because of the larger shares of non¬ 
users in comparison to withdrawers (Table [^. However, these values remain within plausible 
ranges, with the only exception of the CATT corresponding to span 1993-1995 and rfn = —400 
that is appreciably different to the CATTs obtained for the alternative values of r]n. The span 
1998-2000 shows again stabler results: -1573 to -1687 thousands EIT, presumably due to the 
smaller share of POS non-users in comparison to the other two spans. 

To better understand the scale of these results. Table [^reports the percentage reduction in 

CATT 

the cash inventories due to the use of the debit card, CATT ’ AOTC is the Aver¬ 

age Outcome for the Treated Compliers. Eor 1993-95 and 1998-00, the ratios are greater for 
POS users than for withdrawers. This is reasonable because individuals who used debit cards 
to pay at POS usually also used cards to withdraw cash at ATM. Eor 1995-98 the difference 
between POS users and withdrawers decrease in that people usually start to use debit cards to 
withdraw and subsequently to pay at POS, so that the longer the span the more likely that the 
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groups of POS users and withdrawers coincide. Overall the estimated reduction in household 
cash inventory due to the use of the debit card is remarkable, ranging between 78% and 81% 
for the span 1995-1998, between 75% and 78% for the POS users and between 67% and 73% 
for the withdrawers for the other two spans. 


Table 3: Estimated proportion of never-users in the group of treated, Pr(S' = n\Z = 1), 
from the model with the sensitivity parameter ^ in comparison to the model without sensitivity 
parameters and moment estimates. 




1993-1995 

1995-1998 

1998-2000 

POS users 

Model with sensitivity parameter ^ 

.933 

.780 

.690 


Model w/o sensitivity parameters 

.886 

.308 

.612 


Moment Estimate 

.933 

.783 

.690 

Withdrawers 

Model with sensitivity parameter ^ 

.301 

.297 

.140 


Model w/o sensitivity parameters 

.700 

.635 

.407 


Moment Estimate 

.299 

.300 

.140 


5 Discussion 

In this paper we quantify the causal effect of the use of debit cards on households cash inven¬ 
tories in Italy. A principal stratification model integrated with sensitivity parameters allows 
to simultaneously account for issues including the non-negligible share of households who 
hold one debit card but do not use it, the questionable definition of the use of cards as a treat¬ 
ment under the potential outcome approach, and possible violation of the unconfoundedness 
assumption. 
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Our results suggest eonsiderable eausal effeets: the reduetion on eash inventories for 
households who use the debit eard is eonsistently between 70 and 80 per eent during 1993- 
2000. We have evaluated short-term effeets here with only one to one and a half years long 
study period on average. In faet, the SHIW data does not provide information about the mo¬ 
ment a household has aequired its debit eards; we only know it has happened during the two, 
or three, years of the eonsidered span. Therefore the high estimated effeets on eash holding 
also signal that the use of eards quiekly affeets the reduetion of the amount of eash held at 
home. 

Via the sensitivity analysis we have identified a high level of unmeasured eonfounding that 
otherwise would have biased the results. Souree for the eonfounding primarily lies in the part 
of eompliers who have high level of eash inventories even without possessing debit eards. In¬ 
deed jMercatantTand^ (2014) also shows debit eards holders generally have higher level of 
ineome, wealth and edueation of the householder in eomparison to households without debit 
eards. Therefore it is plausible that eompliers are in high soeial and eeonomie statuses. Quan- 
tifieation of the eausal effeets for a larger sub-population of eompliers that inelude households 
in a broad range of soeial elasses, eould be aehieved by extending the observed period, whieh 
would reduee the proportion of never-users. In faet, it is plausible that less reaetive households 
would start to use the eards over time. However, this analysis is not feasible with SHIW data 
for two reasons: first, extended period of observations would greatly reduee the sample sizes; 
seeond, given the inereased diffusion of debit eards, the eontrol group size would eollapse over 
time. Nonetheless, one eould still apply the same eausal model to suitable datasets that allow 
for extending the temporal effeets of debit eards while maintaining an adequate sample size 
for the untreated group. 

Our sensitivity model for the outeome is not identifiable from a frequentist perspeetive, 
with only the sum of the two but not individual sensitivity parameters identifiable. From a 
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Bayesian perspective, the model is weakly identifiable given proper prior distributions for the 
parameters. When available, a secondary outcome variable modeled jointly with the primary 
outcome would sharpen the analysis ( Mealli and Pacinif 2013; Mercatanti et a/.[|2015| ) regard¬ 
less of the mode of inference. Searching for a suitable auxiliary outcome is a direction of our 
future research. 
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Table 4: MLE of ao, a, 6 (CATE) and CATT when = 0 and for fixed values of rj^. CATE 
and CATT denominated in thousands of Italian Eira. Standard errors are in the parenthesis. 



Vn 

-400 

-200 

0 

200 

400 


do 

3.38 (.43) 

2.32 (.27) 

2.34 (.26) 

2.42 (.25) 

2.48 (.25) 

93-95 

d 

.35 (2.46) 

-1.52(1.40) 

-2.00(1.28) 

-2.86 (1.21) 

-3.52(1.17) 

POS users 


-.84 (.48) 

-.67 (.36) 

-.77 (.35) 

-.91 (.35) 

-1.03 (.35) 


CATE 

-2401.9 (291.4) 

-1708.6 (168.4) 

-1673.5 (158.2) 

-1617.5 (156.2) 

-1574.9(161.7) 


CATT 

-2429.6 (911.8) 

-1730.6 (460.5) 

-1698.1 (455.9) 

-1647.9 (450.1) 

-1609.0(471.8) 


do 

2.39 (.24) 

2.38 (.23) 

2.37 (.23) 

2.37 (.23) 

2.38 (.23) 

93-95 

d 

-1.84(1.11) 

-1.97(1.08) 

-2.06(1.07) 

-2.14(1.06) 

-2.22(1.07) 

withdrawers 

? 

-2.80 (.24) 

-2.76 (.23) 

-2.74 (.22) 

-2.72 (.22) 

-2.71 (.22) 


CATE 

-1506.4 (58.7) 

-1493.6 (55.2) 

-1483.2 (53.4) 

-1484.3 (52.9) 

-1469.2 (53.4) 


CATT 

-1562.7 (229.5) 

-1551.3 (229.1) 

-1542.8 (216.9) 

-1536.4 (220.4) 

-1531.0 (217.9) 


do 

3.26 (.48) 

3.19 (.46) 

3.16 (.46) 

3.12 (.45) 

3.11 (.45) 

95-98 

d 

2.23 (1.13) 

1.79(1.07) 

1.64(1.05) 

1.31 (1.02) 

1.11 (.99) 

POS users 


-2.94 (.48) 

-2.68 (.43) 

-2.59 (.41) 

-2.42 (.39) 

-2.33 (.39) 


CATE 

-2881.9 (207.2) 

-2767.1 (182.6) 

-2720.6 (174.6) 

-2637.0 (170.6) 

-2596.3 (174.3) 


CATT 

-2902.8 (479.5) 

-2805.6 (512.6) 

-2763.2 (477.2) 

-2691.6 (432.8) 

-2658.3 (493.8) 


do 

3.67 (.45) 

3.65 (.44) 

3.62 (.44) 

3.59 (.43) 

3.57 (.43) 

95-98 

d 

-.26 (.96) 

-.31 (.94) 

-.37 (.93) 

-.49 (.92) 

-.62 (.91) 

withdrawers 

1 

-4.41 (.43) 

-4.37 (.42) 

-4.32 (.41) 

-4.23 (.40) 

-4.15 (.39) 


CATE 

-2775.4(169.4) 

-2744.1 (159.5) 

-2706.7 (152.8) 

-2646.3 (147.3) 

-2592.1 (144.4) 


CATT 

-2739.6 (305.0) 

-2710.0 (423.9) 

-2674.9 (426.8) 

-2619.3 (433.0) 

-2571.0(403.4) 


do 

2.35 (.23) 

2.25 (.22) 

2.19 (.22) 

2.15 (.22) 

2.11 (.22) 

98-00 

d 

-1.89 (.93) 

-1.75(.91) 

-1.69(.90) 

-1.70 (.89) 

-1.87 (.88) 

POS users 

C 

-1.01 (.30) 

-.94 (.29) 

-.90 (.28) 

-.85 (.29) 

-.77 (.29) 


CATE 

-1785.4(137.5) 

-1761.4(126.0) 

-1739.1 (121.2) 

-1709.9(121.0) 

-1652.8 (63.6) 


CATT 

-1687.3 (391.0) 

-1655.1 (379.4) 

-1632.3 (375.5) 

-1608.3 (373.4) 

-1573.0 (366.0) 


do 

2.00 (.24) 

1.98 (.24) 

1.97 (.23) 

1.96 (.24) 

1.95 (.23) 

98-00 

d 

-.45 (1.02) 

-.42 (3S)2) 

-.41 (1.02) 

-.41 (1.02) 

-.43 (1.03) 

withdrawers 

e 

-3.69 (.34) 

-3.68 (.33) 

-3.67 (.32) 

-3.66 (.32) 

-3.65 (.33) 


CATE 

-1643.9 (72.6) 

-1635.6 (70.9) 

-1628.8 (69.7) 

-1623.4 (69.5) 

-1619.4 (69.9) 


CATT 

-1572.0 (244.6) 

-1563.1 (248.1) 

-1555.6 (239.6) 

-1550.9 (246.3) 

-1547.8 (244.2) 



Table 5: Percentage reduction in the cash inventories due to the use of the debit card, 
C ATT 

AQI^ CATT ’ ^ range of rj^ values. 



1 

O 

O 

rjn = -200 

?7n = 0 

Vn = 200 

= 400 

93-95: POS users 

-.832 

-.779 

-.776 

-.770 

-.766 

93-95: withdrawers 

-.678 

-.676 

-.675 

-.674 

-.673 

95-98: POS users 

-.813 

-.808 

-.806 

-.802 

-.797 

95-98: withdrawers 

-.793 

-.791 

-.789 

-.785 

-.782 

98-00: POS users 

-.758 

-.755 

-.752 

-.749 

-.745 

98-00: withdrawers 

-.730 

-.729 

-.728 

-.728 

-.727 
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